feat!: store-defined concurrency limits#3547
feat!: store-defined concurrency limits#3547d-v-b wants to merge 20 commits intozarr-developers:mainfrom
Conversation
…v-b/zarr-python into feat/global-concurrency-limit
|
Nice, the other one that bugs me when profiling is zarr-python/src/zarr/codecs/zstd.py Line 79 in b3e9aed |
…at/global-concurrency-limit
|
@d-v-b do you have thoughts on how downstream libraries would interact with Zarr's concurrency limit? Would those set there own concurrency approaches (e.g., global or per function) or could there be a way to share the limit with Zarr? |
|
@maxrjones this PR is moving away from a global concurrency limit because it's actually very hard to implement a top-down concurrency control in our codebase. the new direction I'm taking with this PR is for each store to set its own concurrency limit. for stateless stores like local and remote (basically anything except zip), concurrency-sensitive applications should be able to cheaply create store instances with concurrency limits tuned to the needs of the application. |
…at/global-concurrency-limit
…at/global-concurrency-limit
|
this is ready for a look. The basic idea is that we make concurrency limits an implementation detail of a particular store and thereby remove all concurrency limiting logic from array / group routines. Some stores will have no concurrency limits (e.g., memory and local storage) and so their performance will go up. Stores that actually benefit from a concurrency limit (remote stuff) has the default limit of 10 but that's something we can play with. To cut down on boilerplate we (claude and I) define a |
Makes the concurrency limit global, rather than per-function.
I'm not emotionally attached to this PR -- it's largely the work of claude code. Lets use this as a starting point for improving on #3526